1. Introduction

In this notebook, I examine the relationship between utterance fluency (UF) measures and L2 speaking anxiety, addressing the following three research questions (RQs):

  • RQ1: To what extent are speed fluency measures associated with cognitive, somatic, and behavioral speaking anxiety?

  • RQ2: To what extent are breakdown fluency measures associated with cognitive, somatic, and behavioral speaking anxiety?

  • RQ3: To what extent are repair fluency measures associated with cognitive, somatic, and behavioral speaking anxiety?

In the following sections, I preprocess data, conduct preliminary analyses, and construct regression models.

The following code block loads R packages required for analyses.

pacman::p_load(
  tidyverse, here, skimr, psych, 
  PerformanceAnalytics, sjPlot, performance
)

2. Data Preprocess

This section tidies data. First of all, the following code block loads data of UF measures.

df_uf_raw <- read.csv(here("data", "processed", "uf_measures.csv"))

The current analysis focuses on the UF measures shown in Table 1.

Table 1. UF Dimension and Corresponding Measures

UF Dimension Measure
Speed Fluency (SF) Articulation Rate (AR)
Breakdown Fluency (BDF) Mid-Clause Pause Ratio (MCPR)
End-Clause Pause Ratio (ECPR)
Mid-Clause Pause Duration (MCPD)
End-Clause Pause Duration (ECPD)
Repair Fluency (RF) Disfluency Ratio

Thus, the following code block selects target measures. The code block also renames the columns filename and dysfluency_ratio to participant_id and disfluency_ratio, respectively.

df_uf_raw %>%
  select(
    filename,
    articulation_rate,
    mid_clause_pause_ratio,
    end_clause_pause_ratio,
    mid_clause_p.dur,
    end_clause_p.dur,
    dysfluency_ratio
  ) %>%
  rename(
    participant_id = filename,
    disfluency_ratio = dysfluency_ratio
  ) -> df_uf_tidy

df_uf_tidy

Next, the following code block loads data of post questionnaire about anxiety and linguistic background.

df_questionnaire_raw <- read.csv(
  here("data", "raw", "post_questionnaire.csv")
)

The following code block selects columns related to participant ID, anxiety, L2 English proficiency, and other backgrounds and renames them.

df_questionnaire_raw %>%
  select(
    参加者IDを入力してください.半角..,
    英語を話すとき.私は言葉を間違えてしまうのではとしばしば心配になる.,
    英語を話すとき.私は自分の話し方が他人より劣っているのではとしばしば心配になる.,
    英語を話し始めるとすぐに.私は自分自身のことを表現できないのではと心配になり始める.,
    英語を話すとき.私はドキドキすることがよくある.,
    英語を話すとき.私は緊張でびくびくすることがよくある.,
    英語を話すとき.私はよく汗をかいたり.発汗する.,
    英語スピーキングの練習をするとき.私はしばしば.すぐに投げ出してしまう.,
    私は英語スピーキングの練習の手を抜くことが多い.,
    私は普段.極力英語を話さないようにしている.,
    お持ちでしたら.英検のスコアを記入してください.,
    お持ちでしたら.IELTS..Overall..のスコアを記入してください.,
    お持ちでしたら.TOEFL.iBT.のスコアを記入してください.,
    お持ちでしたら.TOEIC.のスコアを記入してください.,
    性別,
    年齢
  ) %>%
  rename(
    participant_id = 参加者IDを入力してください.半角..,
    cognitive_anxiety_1 = 英語を話すとき.私は言葉を間違えてしまうのではとしばしば心配になる.,
    cognitive_anxiety_2 = 英語を話すとき.私は自分の話し方が他人より劣っているのではとしばしば心配になる.,
    cognitive_anxiety_3 = 英語を話し始めるとすぐに.私は自分自身のことを表現できないのではと心配になり始める.,
    somatic_anxiety_1 = 英語を話すとき.私はドキドキすることがよくある.,
    somatic_anxiety_2 = 英語を話すとき.私は緊張でびくびくすることがよくある.,
    somatic_anxiety_3 = 英語を話すとき.私はよく汗をかいたり.発汗する.,
    behavioral_anxiety_1 = 英語スピーキングの練習をするとき.私はしばしば.すぐに投げ出してしまう.,
    behavioral_anxiety_2 = 私は英語スピーキングの練習の手を抜くことが多い.,
    behavioral_anxiety_3 = 私は普段.極力英語を話さないようにしている.,
    IELTS = お持ちでしたら.IELTS..Overall..のスコアを記入してください.,
    TOEFL_IBT = お持ちでしたら.TOEFL.iBT.のスコアを記入してください.,
    TOEIC = お持ちでしたら.TOEIC.のスコアを記入してください.,
    eiken = お持ちでしたら.英検のスコアを記入してください.,
    sex = 性別,
    age = 年齢
  ) %>%
  mutate(
    sex = str_replace_all(sex, "男性", "male"),
    sex = str_replace_all(sex, "女性", "female"),
    eiken = str_replace_all(eiken, "準1級", "1.5"),
    eiken = str_replace_all(eiken, "準1級", "1.5"),
    eiken = str_replace_all(eiken, "2級", "2"),
    eiken = str_replace_all(eiken, "1級", "1"),
    eiken = str_replace_all(eiken, "英検4級", "4"),
    eiken = as.numeric(eiken)
  ) -> df_questionnaire_tidy

df_questionnaire_tidy

3. Preliminary Analysis

3.1. Participant Background

The following code block calculates the descriptive statistics of participants’ age.

df_questionnaire_tidy %>%
  select(age) %>%
  skim()
── Data Summary ────────────────────────
                           Values    
Name                       Piped data
Number of rows             10        
Number of columns          1         
_______________________              
Column type frequency:               
  numeric                  1         
________________________             
Group variables            None      

The result showed that participants’ age was ranged \([20, 38]\), and their mean and SD were \(28.8\) and \(5.613\).

The following code block counts sex of participants.

df_questionnaire_tidy %>%
  group_by(sex) %>%
  summarize(N = n())

The result indicated that there were equal number of male and female participants.

3.2. Proficiency Levels

The following code block converts L2 English assessment scores to CEFR.

cefr_levels = c("A1", "A2", "B1", "B2", "C1", "C2")

df_questionnaire_tidy %>%
  select(participant_id, TOEFL_IBT, IELTS, TOEIC, eiken) %>%
  mutate(
    TOEFL_IBT = case_when(
      TOEFL_IBT >= 114 ~ "C2",
      TOEFL_IBT >= 95 ~ "C1",
      TOEFL_IBT >= 72 ~ "B2"
    ),
    IELTS = case_when(
      IELTS >= 7 ~ "C1"
    ),
    TOEIC = case_when(
      TOEIC >= 945 ~ "C1",
      TOEIC >= 785 ~ "B1",
      TOEIC >= 550 ~ "B2"
    ),
    eiken = case_when(
      eiken >= 3 ~ "A1",
      eiken >= 2 ~ "B1",
      eiken >= 1.5 ~ "B2",
      eiken >= 1 ~ "C1"
    )
  ) %>%
  mutate(
    TOEFL_IBT = factor(TOEFL_IBT, levels = cefr_levels, order = T),
    IELTS = factor(IELTS, levels = cefr_levels, order = T),
    TOEIC = factor(TOEIC, levels = cefr_levels, order = T),
    eiken = factor(eiken, levels = cefr_levels, order = T)
  ) %>% 
  replace_na(list(
    TOEFL_IBT = "A1",
    IELTS = "A1",
    TOEIC = "A1",
    eiken = "A1"
  )) %>%
  group_by(participant_id) %>%
  summarize(CEFR = max(TOEFL_IBT, IELTS, TOEIC, eiken)) %>%
  group_by(CEFR) %>%
  summarize(N = n())

The result suggested that most participants were advanced-level L2 English learners (\(N=6\)), while other four participants were intermediate-level English learners.

3.3. UF Measures

The following code block calculates the descriptive statistics of UF measures.

df_uf_tidy %>%
  select(-c(participant_id)) %>%
  skim()
── Data Summary ────────────────────────
                           Values    
Name                       Piped data
Number of rows             10        
Number of columns          6         
_______________________              
Column type frequency:               
  numeric                  6         
________________________             
Group variables            None      

3.4. Speaking Anxiety

The following code blocks calculate the reliability of anxiety questionnaire answers in terms of Cronbach’s \(\alpha\).

3.4.1. Reliability of Cognitive Anxiety

df_questionnaire_tidy %>% 
  select(
    cognitive_anxiety_1,
    cognitive_anxiety_2,
    cognitive_anxiety_3
  ) %>%
  alpha()

Reliability analysis   
Call: alpha(x = .)

 

    95% confidence boundaries 

 Reliability if an item is dropped:

 Item statistics 

Non missing response frequency for each item
                      1   2   3   4   5 miss
cognitive_anxiety_1 0.0 0.3 0.1 0.3 0.3    0
cognitive_anxiety_2 0.1 0.1 0.0 0.6 0.2    0
cognitive_anxiety_3 0.2 0.3 0.0 0.2 0.3    0

The Cornbach’s \(\alpha\) of cognitive anxiety questionnaires was \(.747\), indicating an acceptable internal consistency

3.4.2. Reliability of Somatic Anxiety

df_questionnaire_tidy %>% 
  select(
    somatic_anxiety_1,
    somatic_anxiety_2,
    somatic_anxiety_3
  ) %>%
  alpha()

Reliability analysis   
Call: alpha(x = .)

 

    95% confidence boundaries 

 Reliability if an item is dropped:

 Item statistics 

Non missing response frequency for each item
                    1   2   3   4   5 miss
somatic_anxiety_1 0.2 0.3 0.1 0.2 0.2    0
somatic_anxiety_2 0.3 0.2 0.2 0.0 0.3    0
somatic_anxiety_3 0.3 0.3 0.0 0.4 0.0    0

The Cornbach’s \(\alpha\) of somatic anxiety questionnaires was \(.900\), indicating a excellent internal consistency.

3.4.3. Behavioral Anxiety

df_questionnaire_tidy %>% 
  select(
    behavioral_anxiety_1,
    behavioral_anxiety_2,
    behavioral_anxiety_3
  ) %>%
  alpha()

Reliability analysis   
Call: alpha(x = .)

 

    95% confidence boundaries 

 Reliability if an item is dropped:

 Item statistics 

Non missing response frequency for each item
                       1   2   3   4   5 miss
behavioral_anxiety_1 0.2 0.4 0.2 0.1 0.1    0
behavioral_anxiety_2 0.2 0.3 0.1 0.3 0.1    0
behavioral_anxiety_3 0.4 0.3 0.2 0.1 0.0    0

The Cornbach’s \(\alpha\) of somatic anxiety questionnaires was \(.814\), indicating a good internal consistency.

3.5. Correlations

This subsection conducts correlation analyses between UF measures and speaking anxiety. Before the analyses, the following code block calculates three anxiety scores by summing items and joins the two dataframes.

df_questionnaire_tidy %>%
  group_by(participant_id) %>%
  summarize(
    cognitive_anxiety = sum(
      cognitive_anxiety_1, cognitive_anxiety_2, cognitive_anxiety_3
    ),
    somatic_anxiety = sum(
      somatic_anxiety_1, somatic_anxiety_2, somatic_anxiety_3
    ),
    behavioral_anxiety = sum(
      behavioral_anxiety_1, behavioral_anxiety_2, behavioral_anxiety_3
    )
  ) %>% 
  inner_join(
    df_uf_tidy, by = "participant_id"
  ) -> df_uf_anxiety

df_uf_anxiety

3.5.1. Correlations Among Anxiety Scores

The following code block generates a correlation matrix of three anxiety scores.

df_uf_anxiety %>%
  select(
    cognitive_anxiety,
    somatic_anxiety,
    behavioral_anxiety
  ) %>%
  chart.Correlation(histogram = T, method = "pearson", pch = 19)
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter

The result showed that significant strong correlation between cognitive and somatic anxiety (\(r=.78\)). In addition, there was slightly significant medium correlation between somatic and behavioral anxiety (\(r=.59\)). Meanwhile, there was small correlation between cognitive and behavioral anxiety (\(r=.48\)), but statistical significance was not found.

3.5.2. Correlations Between UF Measures and Cognitive Anxiety

The following code block generates a correlation matrix of UF measures and cognitive anxiety.

df_uf_anxiety %>%
  select(-c(
    participant_id, 
    somatic_anxiety, 
    behavioral_anxiety
  )) %>%
  chart.Correlation(histogram = T, method = "pearson", pch = 19)
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter

The result showed significant large correlation between cognitive anxiety and ECPR, suggesting that participants produced end-clause pauses more frequently when cognitive anxiety was lower.

3.5.3. Correlations Between UF Measures and Somatic Anxiety

The following code block generates a correlation matrix of UF measures and somatic anxiety.

df_uf_anxiety %>%
  select(-c(
    participant_id, 
    cognitive_anxiety, 
    behavioral_anxiety
  )) %>%
  chart.Correlation(histogram = T, method = "pearson", pch = 19)
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter

The result did not show significant correlations between somatic anxiety and UF measures.

3.5.4. Correlations Between UF Measures and Behavioral Anxiety

The following code block generates a correlation matrix of UF measures and behavioral anxiety.

df_uf_anxiety %>%
  select(-c(
    participant_id, 
    cognitive_anxiety, 
    somatic_anxiety
  )) %>%
  chart.Correlation(histogram = T, method = "pearson", pch = 19)
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter
Warning in par(usr) : argument 1 does not name a graphical parameter

The result identified significant large correlations between behavioral anxiety and MCPD (\(r=-.76\)) and ECPD (\(r=.79\)), suggesting that participants who perceived lower behavioral anxiety generate longer MCPD and ECPD.


4. Regression Analyses

4.1. SF ~ Speaking Anxiety

The following code block constructs a regression model between AR and speaking anxiety.

model_ar <- lm(
  articulation_rate ~ cognitive_anxiety + somatic_anxiety + behavioral_anxiety,
  df_uf_anxiety
)

The following code block shows the result.

tab_model(model_ar, digits = 3)

The result did not show significant relationship between three anxiety scores and AR.

To examine the reliability of the current model, the following code block conducts post-hoc analysis of regression assumptions.

check_model(model_ar)

The results suggested that the current model could be reliable.

4.2. BDF ~ Speaking Anxiety

4.2.1. MCPR ~ Speaking Anxiety

The following code block constructs a regression model between MCPR and speaking anxiety.

model_mcpr <- lm(
  mid_clause_pause_ratio ~ cognitive_anxiety + somatic_anxiety + behavioral_anxiety,
  df_uf_anxiety
)

The following code block shows the result.

tab_model(model_mcpr, digits = 3)

The result did not show significant relationship between three anxiety scores and MCPR.

To examine the reliability of the current model, the following code block conducts post-hoc analysis of regression assumptions.

check_model(model_mcpr)

The results suggested that the current model could be less reliable because the model could be affected by an outlier.

4.2.2. ECPR ~ Speaking Anxiety

The following code block constructs a regression model between ECPR and speaking anxiety.

model_ecpr <- lm(
  end_clause_pause_ratio ~ cognitive_anxiety + somatic_anxiety + behavioral_anxiety,
  df_uf_anxiety
)

The following code block shows the result.

tab_model(model_ecpr, digits = 3)

The result showed slightly significant relationship between cognitive anxiety and ECPR (\(p=.081\)). The slope was \(-.003\) and suggested that ECPR decreased \(.003\) when cognitive anxiety increased \(1\).

To examine the reliability of the current model, the following code block conducts post-hoc analysis of regression assumptions.

check_model(model_ecpr)

The results suggested that the current model could be reliable.

4.2.3. MCPD ~ Speaking Anxiety

The following code block constructs a regression model between MCPD and speaking anxiety.

model_mcpd <- lm(
  mid_clause_p.dur ~ cognitive_anxiety + somatic_anxiety + behavioral_anxiety,
  df_uf_anxiety
)

The following code block shows the result.

tab_model(model_mcpd, digits = 3)

The result showed a significant link between behavioral anxiety and MCPD (\(p=.025\)). The slope was \(-.074\), suggesting that MCPD became \(.074\) shorter when behavioral anxiety score increased \(1\).

To examine the reliability of the current model, the following code block conducts post-hoc analysis of regression assumptions.

check_model(model_mcpd)

The results suggested that the current model could be reliable.

4.2.4. ECPD ~ Speaking Anxiety

The following code block constructs a regression model between AR and speaking anxiety.

model_enpd <- lm(
  end_clause_p.dur ~ cognitive_anxiety + somatic_anxiety + behavioral_anxiety,
  df_uf_anxiety
)

The following code block shows the result.

tab_model(model_enpd, digits = 3)

The result showed a significant link between behavioral anxiety and MCPD (\(p=.020\)). The slope was \(-.072\), suggesting that ECPD became \(.072\) shorter when behavioral anxiety score increased \(1\).

To examine the reliability of the current model, the following code block conducts post-hoc analysis of regression assumptions.

check_model(model_enpd)

The results suggested that the current model could be reliable.

4.3. RF ~ Speaking Anxiety

The following code block constructs a regression model between DR and speaking anxiety.

model_dr <- lm(
  disfluency_ratio ~ cognitive_anxiety + somatic_anxiety + behavioral_anxiety,
  df_uf_anxiety
)

The following code block shows the result.

tab_model(model_dr, digits = 3)

The result did not show significant relationship between three anxiety scores and DR.

To examine the reliability of the current model, the following code block conducts post-hoc analysis of regression assumptions.

check_model(model_dr)

Quitting from lines 526-527 [unnamed-chunk-35] (final_report_analysis.Rmd)

The results suggested that the current model could be less reliable because the model could be affected by an outlier.

---
title: 'IntroR: Final Report Analysis'
author: "Ryuki Matsuura"
date: "2024/11/27"
output:
  html_document:
    toc: true
    df_print: paged
  html_notebook:
    toc: true
    toc_float: true
    theme: flatly
---

## 1. Introduction

In this notebook, I examine the relationship between utterance fluency (UF) measures and L2 speaking anxiety, addressing the following three research questions (RQs):

-   RQ1: To what extent are speed fluency measures associated with cognitive, somatic, and behavioral speaking anxiety?

-   RQ2: To what extent are breakdown fluency measures associated with cognitive, somatic, and behavioral speaking anxiety?

-   RQ3: To what extent are repair fluency measures associated with cognitive, somatic, and behavioral speaking anxiety?

In the following sections, I preprocess data, conduct preliminary analyses, and construct regression models.

The following code block loads R packages required for analyses.

```{r}
pacman::p_load(
  tidyverse, here, skimr, psych, 
  PerformanceAnalytics, sjPlot, performance
)
```

------------------------------------------------------------------------

## 2. Data Preprocess

This section tidies data. First of all, the following code block loads data of UF measures.

```{r}
df_uf_raw <- read.csv(here("data", "processed", "uf_measures.csv"))
```

The current analysis focuses on the UF measures shown in Table 1.

Table 1. UF Dimension and Corresponding Measures

| UF Dimension            | Measure                          |
|-------------------------|----------------------------------|
| Speed Fluency (SF)      | Articulation Rate (AR)           |
| Breakdown Fluency (BDF) | Mid-Clause Pause Ratio (MCPR)    |
|                         | End-Clause Pause Ratio (ECPR)    |
|                         | Mid-Clause Pause Duration (MCPD) |
|                         | End-Clause Pause Duration (ECPD) |
| Repair Fluency (RF)     | Disfluency Ratio                 |

Thus, the following code block selects target measures. The code block also renames the columns `filename` and `dysfluency_ratio` to `participant_id` and `disfluency_ratio`, respectively.

```{r}
df_uf_raw %>%
  select(
    filename,
    articulation_rate,
    mid_clause_pause_ratio,
    end_clause_pause_ratio,
    mid_clause_p.dur,
    end_clause_p.dur,
    dysfluency_ratio
  ) %>%
  rename(
    participant_id = filename,
    disfluency_ratio = dysfluency_ratio
  ) -> df_uf_tidy

df_uf_tidy
```

Next, the following code block loads data of post questionnaire about anxiety and linguistic background.

```{r}
df_questionnaire_raw <- read.csv(
  here("data", "raw", "post_questionnaire.csv")
)
```

The following code block selects columns related to participant ID, anxiety, L2 English proficiency, and other backgrounds and renames them.

```{r}
df_questionnaire_raw %>%
  select(
    参加者IDを入力してください.半角..,
    英語を話すとき.私は言葉を間違えてしまうのではとしばしば心配になる.,
    英語を話すとき.私は自分の話し方が他人より劣っているのではとしばしば心配になる.,
    英語を話し始めるとすぐに.私は自分自身のことを表現できないのではと心配になり始める.,
    英語を話すとき.私はドキドキすることがよくある.,
    英語を話すとき.私は緊張でびくびくすることがよくある.,
    英語を話すとき.私はよく汗をかいたり.発汗する.,
    英語スピーキングの練習をするとき.私はしばしば.すぐに投げ出してしまう.,
    私は英語スピーキングの練習の手を抜くことが多い.,
    私は普段.極力英語を話さないようにしている.,
    お持ちでしたら.英検のスコアを記入してください.,
    お持ちでしたら.IELTS..Overall..のスコアを記入してください.,
    お持ちでしたら.TOEFL.iBT.のスコアを記入してください.,
    お持ちでしたら.TOEIC.のスコアを記入してください.,
    性別,
    年齢
  ) %>%
  rename(
    participant_id = 参加者IDを入力してください.半角..,
    cognitive_anxiety_1 = 英語を話すとき.私は言葉を間違えてしまうのではとしばしば心配になる.,
    cognitive_anxiety_2 = 英語を話すとき.私は自分の話し方が他人より劣っているのではとしばしば心配になる.,
    cognitive_anxiety_3 = 英語を話し始めるとすぐに.私は自分自身のことを表現できないのではと心配になり始める.,
    somatic_anxiety_1 = 英語を話すとき.私はドキドキすることがよくある.,
    somatic_anxiety_2 = 英語を話すとき.私は緊張でびくびくすることがよくある.,
    somatic_anxiety_3 = 英語を話すとき.私はよく汗をかいたり.発汗する.,
    behavioral_anxiety_1 = 英語スピーキングの練習をするとき.私はしばしば.すぐに投げ出してしまう.,
    behavioral_anxiety_2 = 私は英語スピーキングの練習の手を抜くことが多い.,
    behavioral_anxiety_3 = 私は普段.極力英語を話さないようにしている.,
    IELTS = お持ちでしたら.IELTS..Overall..のスコアを記入してください.,
    TOEFL_IBT = お持ちでしたら.TOEFL.iBT.のスコアを記入してください.,
    TOEIC = お持ちでしたら.TOEIC.のスコアを記入してください.,
    eiken = お持ちでしたら.英検のスコアを記入してください.,
    sex = 性別,
    age = 年齢
  ) %>%
  mutate(
    sex = str_replace_all(sex, "男性", "male"),
    sex = str_replace_all(sex, "女性", "female"),
    eiken = str_replace_all(eiken, "準1級", "1.5"),
    eiken = str_replace_all(eiken, "準１級", "1.5"),
    eiken = str_replace_all(eiken, "2級", "2"),
    eiken = str_replace_all(eiken, "1級", "1"),
    eiken = str_replace_all(eiken, "英検4級", "4"),
    eiken = as.numeric(eiken)
  ) -> df_questionnaire_tidy

df_questionnaire_tidy
```

------------------------------------------------------------------------

## 3. Preliminary Analysis

### 3.1. Participant Background

The following code block calculates the descriptive statistics of participants' age.

```{r}
df_questionnaire_tidy %>%
  select(age) %>%
  skim()
```

The result showed that participants' age was ranged $[20, 38]$, and their mean and SD were $28.8$ and $5.613$.

The following code block counts sex of participants.

```{r}
df_questionnaire_tidy %>%
  group_by(sex) %>%
  summarize(N = n())
```

The result indicated that there were equal number of male and female participants.

### 3.2. Proficiency Levels

The following code block converts L2 English assessment scores to CEFR.

```{r}
cefr_levels = c("A1", "A2", "B1", "B2", "C1", "C2")

df_questionnaire_tidy %>%
  select(participant_id, TOEFL_IBT, IELTS, TOEIC, eiken) %>%
  mutate(
    TOEFL_IBT = case_when(
      TOEFL_IBT >= 114 ~ "C2",
      TOEFL_IBT >= 95 ~ "C1",
      TOEFL_IBT >= 72 ~ "B2"
    ),
    IELTS = case_when(
      IELTS >= 7 ~ "C1"
    ),
    TOEIC = case_when(
      TOEIC >= 945 ~ "C1",
      TOEIC >= 785 ~ "B1",
      TOEIC >= 550 ~ "B2"
    ),
    eiken = case_when(
      eiken >= 3 ~ "A1",
      eiken >= 2 ~ "B1",
      eiken >= 1.5 ~ "B2",
      eiken >= 1 ~ "C1"
    )
  ) %>%
  mutate(
    TOEFL_IBT = factor(TOEFL_IBT, levels = cefr_levels, order = T),
    IELTS = factor(IELTS, levels = cefr_levels, order = T),
    TOEIC = factor(TOEIC, levels = cefr_levels, order = T),
    eiken = factor(eiken, levels = cefr_levels, order = T)
  ) %>% 
  replace_na(list(
    TOEFL_IBT = "A1",
    IELTS = "A1",
    TOEIC = "A1",
    eiken = "A1"
  )) %>%
  group_by(participant_id) %>%
  summarize(CEFR = max(TOEFL_IBT, IELTS, TOEIC, eiken)) %>%
  group_by(CEFR) %>%
  summarize(N = n())
```

The result suggested that most participants were advanced-level L2 English learners ($N=6$), while other four participants were intermediate-level English learners.

### 3.3. UF Measures

The following code block calculates the descriptive statistics of UF measures.

```{r}
df_uf_tidy %>%
  select(-c(participant_id)) %>%
  skim()
```

### 3.4. Speaking Anxiety

The following code blocks calculate the reliability of anxiety questionnaire answers in terms of Cronbach's $\alpha$.

#### *3.4.1. Reliability of Cognitive Anxiety*

```{r}
df_questionnaire_tidy %>% 
  select(
    cognitive_anxiety_1,
    cognitive_anxiety_2,
    cognitive_anxiety_3
  ) %>%
  alpha()
```

The Cornbach's $\alpha$ of cognitive anxiety questionnaires was $.747$, indicating an acceptable internal consistency

#### *3.4.2. Reliability of Somatic Anxiety*

```{r}
df_questionnaire_tidy %>% 
  select(
    somatic_anxiety_1,
    somatic_anxiety_2,
    somatic_anxiety_3
  ) %>%
  alpha()
```

The Cornbach's $\alpha$ of somatic anxiety questionnaires was $.900$, indicating a excellent internal consistency.

#### 3.4.3. Behavioral Anxiety

```{r}
df_questionnaire_tidy %>% 
  select(
    behavioral_anxiety_1,
    behavioral_anxiety_2,
    behavioral_anxiety_3
  ) %>%
  alpha()
```

The Cornbach's $\alpha$ of somatic anxiety questionnaires was $.814$, indicating a good internal consistency.

### 3.5. Correlations

This subsection conducts correlation analyses between UF measures and speaking anxiety. Before the analyses, the following code block calculates three anxiety scores by summing items and joins the two dataframes.

```{r}
df_questionnaire_tidy %>%
  group_by(participant_id) %>%
  summarize(
    cognitive_anxiety = sum(
      cognitive_anxiety_1, cognitive_anxiety_2, cognitive_anxiety_3
    ),
    somatic_anxiety = sum(
      somatic_anxiety_1, somatic_anxiety_2, somatic_anxiety_3
    ),
    behavioral_anxiety = sum(
      behavioral_anxiety_1, behavioral_anxiety_2, behavioral_anxiety_3
    )
  ) %>% 
  inner_join(
    df_uf_tidy, by = "participant_id"
  ) -> df_uf_anxiety

df_uf_anxiety
```

#### 3.5.1. Correlations Among Anxiety Scores

The following code block generates a correlation matrix of three anxiety scores.

```{r}
df_uf_anxiety %>%
  select(
    cognitive_anxiety,
    somatic_anxiety,
    behavioral_anxiety
  ) %>%
  chart.Correlation(histogram = T, method = "pearson", pch = 19)
```

The result showed that significant strong correlation between cognitive and somatic anxiety ($r=.78$). In addition, there was slightly significant medium correlation between somatic and behavioral anxiety ($r=.59$). Meanwhile, there was small correlation between cognitive and behavioral anxiety ($r=.48$), but statistical significance was not found.

#### 3.5.2. Correlations Between UF Measures and Cognitive Anxiety

The following code block generates a correlation matrix of UF measures and cognitive anxiety.

```{r}
df_uf_anxiety %>%
  select(-c(
    participant_id, 
    somatic_anxiety, 
    behavioral_anxiety
  )) %>%
  chart.Correlation(histogram = T, method = "pearson", pch = 19)
```

The result showed significant large correlation between cognitive anxiety and ECPR, suggesting that participants produced end-clause pauses more frequently when cognitive anxiety was lower.

#### 3.5.3. Correlations Between UF Measures and Somatic Anxiety

The following code block generates a correlation matrix of UF measures and somatic anxiety.

```{r}
df_uf_anxiety %>%
  select(-c(
    participant_id, 
    cognitive_anxiety, 
    behavioral_anxiety
  )) %>%
  chart.Correlation(histogram = T, method = "pearson", pch = 19)
```

The result did not show significant correlations between somatic anxiety and UF measures.

#### 3.5.4. Correlations Between UF Measures and Behavioral Anxiety

The following code block generates a correlation matrix of UF measures and behavioral anxiety.

```{r}
df_uf_anxiety %>%
  select(-c(
    participant_id, 
    cognitive_anxiety, 
    somatic_anxiety
  )) %>%
  chart.Correlation(histogram = T, method = "pearson", pch = 19)
```

The result identified significant large correlations between behavioral anxiety and MCPD ($r=-.76$) and ECPD ($r=.79$), suggesting that participants who perceived lower behavioral anxiety generate longer MCPD and ECPD.

------------------------------------------------------------------------

## 4. Regression Analyses

### 4.1. SF \~ Speaking Anxiety

The following code block constructs a regression model between AR and speaking anxiety.

```{r}
model_ar <- lm(
  articulation_rate ~ cognitive_anxiety + somatic_anxiety + behavioral_anxiety,
  df_uf_anxiety
)
```

The following code block shows the result.

```{r}
tab_model(model_ar, digits = 3)
```

The result did not show significant relationship between three anxiety scores and AR.

To examine the reliability of the current model, the following code block conducts post-hoc analysis of regression assumptions.

```{r}
check_model(model_ar)
```

The results suggested that the current model could be reliable.

### 4.2. BDF \~ Speaking Anxiety

#### 4.2.1. MCPR \~ Speaking Anxiety

The following code block constructs a regression model between MCPR and speaking anxiety.

```{r}
model_mcpr <- lm(
  mid_clause_pause_ratio ~ cognitive_anxiety + somatic_anxiety + behavioral_anxiety,
  df_uf_anxiety
)
```

The following code block shows the result.

```{r}
tab_model(model_mcpr, digits = 3)
```

The result did not show significant relationship between three anxiety scores and MCPR.

To examine the reliability of the current model, the following code block conducts post-hoc analysis of regression assumptions.

```{r}
check_model(model_mcpr)
```

The results suggested that the current model could be less reliable because the model could be affected by an outlier.

#### 4.2.2. ECPR \~ Speaking Anxiety

The following code block constructs a regression model between ECPR and speaking anxiety.

```{r}
model_ecpr <- lm(
  end_clause_pause_ratio ~ cognitive_anxiety + somatic_anxiety + behavioral_anxiety,
  df_uf_anxiety
)
```

The following code block shows the result.

```{r}
tab_model(model_ecpr, digits = 3)
```

The result showed slightly significant relationship between cognitive anxiety and ECPR ($p=.081$). The slope was $-.003$ and suggested that ECPR decreased $.003$ when cognitive anxiety increased $1$.

To examine the reliability of the current model, the following code block conducts post-hoc analysis of regression assumptions.

```{r}
check_model(model_ecpr)
```

The results suggested that the current model could be reliable.

#### 4.2.3. MCPD \~ Speaking Anxiety

The following code block constructs a regression model between MCPD and speaking anxiety.

```{r}
model_mcpd <- lm(
  mid_clause_p.dur ~ cognitive_anxiety + somatic_anxiety + behavioral_anxiety,
  df_uf_anxiety
)
```

The following code block shows the result.

```{r}
tab_model(model_mcpd, digits = 3)
```

The result showed a significant link between behavioral anxiety and MCPD ($p=.025$). The slope was $-.074$, suggesting that MCPD became $.074$ shorter when behavioral anxiety score increased $1$.

To examine the reliability of the current model, the following code block conducts post-hoc analysis of regression assumptions.

```{r}
check_model(model_mcpd)
```

The results suggested that the current model could be reliable.

#### 4.2.4. ECPD \~ Speaking Anxiety

The following code block constructs a regression model between AR and speaking anxiety.

```{r}
model_ecpd <- lm(
  end_clause_p.dur ~ cognitive_anxiety + somatic_anxiety + behavioral_anxiety,
  df_uf_anxiety
)
```

The following code block shows the result.

```{r}
tab_model(model_ecpd, digits = 3)
```

The result showed a significant link between behavioral anxiety and MCPD ($p=.020$). The slope was $-.072$, suggesting that ECPD became $.072$ shorter when behavioral anxiety score increased $1$.

To examine the reliability of the current model, the following code block conducts post-hoc analysis of regression assumptions.

```{r}
check_model(model_ecpd)
```

The results suggested that the current model could be reliable.

### 4.3. RF \~ Speaking Anxiety

The following code block constructs a regression model between DR and speaking anxiety.

```{r}
model_dr <- lm(
  disfluency_ratio ~ cognitive_anxiety + somatic_anxiety + behavioral_anxiety,
  df_uf_anxiety
)
```

The following code block shows the result.

```{r}
tab_model(model_dr, digits = 3)
```

The result did not show significant relationship between three anxiety scores and DR.

To examine the reliability of the current model, the following code block conducts post-hoc analysis of regression assumptions.

```{r}
check_model(model_dr)
```

The results suggested that the current model could be less reliable because the model could be affected by an outlier.

### 
